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Recombinant acquired Immune deficiency syndrome (AIDS) viral envelope protein and method of 
O) testing for AIDS. 
O) 



©© Recombinant envelope proteins of the etiologic 
agent of acquired immune deficiency syndrome - 

QJ(AIOS) useful in diagnosis and therapy of AIDS and a 
method for their preparation are described. Proviral 
DNA is transferred into a host cell after engineering 



into an expression vector which produces the en- 
velope protein. A method of testing human blood for 
the presence of antibodies to the AIDS virus using 
the AIDS envelope protein is also described. 
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lymphadenopathy-associated virus (LAV), AIDS-as- 
sociated retrovirus (ARV) and the human T-cell 
leukemia/lymphoma/lymphotropic virus type III • 
(HTLV III). 1 

A further aspect of this invention relates to a 5 
diagnostic method for testing human Wood for the 
presence of antibodies to the env AIDS protein. 
This aspect of the invention overcomes the prob- 
lems of all previously used blood tests for AIDS. 
One of the problems is the use of compositions to to 
bind AIDS yitibody which contain proteins or pep- 
tides which were not derived solely from the AIDS 
etiologic agent. A composition using homogeneous 
envelope AIDS protein of this invention overcomes 
the nonspecificity of the prior tests or assays. Yet 75 
another aspect of this invention is a diagnostic 
method for detecting and/or determining the pres- 
ence of the antigen in human blood. 

Another aspect of this invention is to use the 
env AIDS proteins of the instant invention as anti- to 
gens suitable for providing protective immunity 
against AIDS when incorporated into a vaccine. 

Brief Description of the Drawings 



Fig. 1. The nucleotide sequence of the en- 
velope gene of the HTLV-HI proviral genome 
(HXB-3). 

30 

Fig. 2. Comparison of the amino acid se- 
quence of the env protein of the five pur- 
ported etiologic agents of AIDS. Amino acid 
sequences are aligned to give maximum ho- 
mology. 35 

Fig. 3. Construction of the pEV7env44-640 
expression plasmids. The upper left panel 
shows a simplified restriction site map of the 
3.15 Kb EcoFU-Xho) segment of the HTLV-HI «? 
genome which contains the env coding re- 
gion (cross-hatched arrow). The right panel 
shows the structure and pertinent sequences 
of the pEV-vrf plasmids. The solid black 
region represents the synthetic ribosome 45 
binding site sequences upstream of the ATG 
initiation codon (overlined). See Example 2 
for a detailed description of the env expres- 
sion plasmid constructions. 

so 

Fig. 4. Western blot analysis of env coded 
antigens produced in E. coli A Total bacterial 
proteins were resolved by SDS-PAGE, 
electro-blotted onto a nitrocellulose fitter, and 
env encoded proteins were detected by 55 
reacting with human sera as described in 



Example 5: a) negative control, cells contain- 
ing pJCL-E30 (p2lT) induced at 42'C for 2 
hours; b) uninduced control, cells containing 
pEV3/env44-640 maintained at 30*C; c) 
pEV3/env44-640; d) pEV1/env44-640; and e)* k 
pEV3/env205-640 induced at 42*C for 2 
hours. 

Fig. 5. Recognition of bacterialfy synthesized 
HTLV-HI env gene products by antibodies in 
AIDS patient sera. Bacterial lysates contain- 
ing recombinant env proteins were subjected 
to Western blot analysis as described in 
Example 5. Individual strips were then in- 
cubated with a 1000-fold dilution of individual 
sera followed by treatment with ^labeled 
protein A. (upper part) Serum samples were 
from the following donors: (lane 1) normal 
healthy donor; (lanes 2-18) AIDS patient sera 
collected from the West Coast of the USA. - 
(Lower part) Serum samples were taken from 
the following donors: (lane 1) donor found to 
be HTLV-1( + ) by Elisa using disrupted 
virus: (lanes 4, 5, 11 and 15) healthy, normal 
donors; (lanes 2. 3. 6. 8. 10. 12, 13. 14, 16, 
17 and 18) AIDS patient sera from the East - 
Coast of the USA. 

Fig. 6A. The amino acid sequence of the 
AIDS envelope protein. 

Fig. 6B. The amino acid distribution of the 
AIDS envelope protein. 

Fig. 7. Construction of the expression vector 
pRC23. The Shine-Dalgamo sequence (SD) 
is overlined and the location of the synthetic 
ribosome binding site sequence in the plas- 
mid is represented by the solid black seg- 
ment The plasmid contains the entire se- 
quence of pBR322 and thus confers resis- 
tance to both ampictlltn (amp*) and 
tetracycline (tet*). 

Fig. 8. Construction of the pEV-vrf vectors. 
The synthetic oligonucleotides for each plas- 
mid which were placed downstream of the 
SD sequence in pRC23 are shown with the 
locations of the restriction enzyme cleavage 
sites. The ATQ initiation codon is overlined, 
and the placement of the additional A-T base 
pairs is designated by the rectangle. The 
plasmids confer resistance to ampicillin only. 
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cytopathic characteristic of HTLV III was critical to 
determining ultimately the specific retroviral origin 
of the disease. Thus the etiologic agent of AIDS 
was isolated by use of immortalized human neo- 
plastic T cell lines (HT) infected with the cytopathic 
retrovirus characteristic of AIDS, isolated from AIDS 
afflicted patients. Seroepidemiological assays using 
this virus showed a complete correlation between 
AIDS and the presence of antibodies to HTLV III 
antigens [Sarngadharan, M.G. et al., supra; 
Schupbach, J. et al.. supra]. In addition, nearly 
85% of patients with tymphadenopathy syndrome 
and a significant proportion of asymptomatic ho- 
mosexual men in AIDS endemic areas were also 
found to carry circulating antibodies to HTLV III. 
Taken together, all these data indicate HTLV III to 
be the etiologic agent for AIDS. 

Until the successful culturing of AIDS virus 
using H-9 cell line [PCT application, publication no. 
WO 85/04897] the env AIDS protein of the AIDS 
virus had not been isolated, characterized or syn- 
thesized. This in major part is due to the fact that 
the virus is cytopathic and thus isolation of the 
virus was not possible [Popovic, M. et al.. supra]. 
Once the human T-cell fine resistant to the 
cytopathic effects of the virus was discovered, a 
molecular clone of proviral DNA could be achieved. 

The need for a sensitive and rapid method for 
the diagnosis of AIDS in human blood and its 
prevention by vaccination is very great. Virtually all 
the assays/tests presently available are fraught with 
errors. In fact the Center for Disease Control - 
(CDC) has indicated that presently available tests 
be used solely for screening units of blood for 
antibody to HTLV 111. The CDC went further by 
stating that the presently available ELISA tests can 
not be used for general screening of high risk 
papulations or as a diagnostic test for AIDS - 
[Federal Register 50(48). 9909. March 12. 1985]. 
The errors have been traced to the failure to use a 
specific antigenic protein of the etiologic agent for 
AIDS. The previously used proteins were derived 
from a viral lysate. Since the lysate is made from 
human cells infected with the virus, i.e. the cells 
used to grow the virus, the lysate will contain 
human proteins as well as viral proteins. Thus 
preparation of a pure antigen of viral protein is very 
difficult. The antigen used produced both false 
positive and false negative results [Budiansky. $.. 
"AIDS Screening, False Test Results Raise 
Doubts". Nature 312. 583(1984)]. The errors caus- 
ed by the use of such lysate proteins/peptides can 
be avoided by using a composi tion for binding 



AIDS antibodies which is substantially free of the 
non-AIDS specific proteins. Compositions that are 
substantially pure AIDS envelope protein can be 
used as antigens. 

s The AIDS envelope protein of the instant inven- 
tion has been established to have conserved epi- 
topes which permit its use to screen for, diagnose 
and/or prevent by vaccination the infection by AIDS 
virus. The instant invention demonstrates that the 

io envelope protein with its conserved epitopes in- 
cludes all the variants which have been claimed as 
the sole etiologic agent. 

The envelope AIDS protein of the present in- 
vention may be produced by conventionally known 

rs methods. The processes by which the novel pro- 
tein may be produced can be divided into three 
groups: (1) chemical synthesis; (2) preparation of a 
gene prepared by chemical synthesis which is in- 
serted into a host and a protein is produced by the 

20 host and (3) a corresponding gene obtained 
biotechnically is inserted into a host and a protein 
is produced by the host. 

In one embodiment of this invention, recom- 
binant DNA techniques are utilized by which env 

25 AIDS DNA from a natural source is introduced into 
a cell to produce the. env AIDS protein. One meth- 
od of obtaining DNA which encodes env AIDS is to 
read the genetic code in reverse and synthesize an 
oligodideoxynucleotide which should encode the 

so env AIDS amino acid sequence. As the env protein 
has not been isolated or characterized this ap- 
proach cannot be pursued. 

Alternatively gene expression can be obtained 
using recombinant DNA technology if DNA isolated 

35 from natural sources is used instead of synthetic 
DNA. 

Summary of the Invention 

40 This invention is directed to the engineering of 
HTLV III env gene into suitable expression vectors; 
transformation of host organisms with such expres- 
sion vectors; and production of envelope AIDS pro- 
tein (env AIDS) by culture of such transformed 

45 cells. Another aspect of the present invention re- 
lates to the isolation and use of the resulting re- 
combinant env AIDS protein. 

Another aspect of the present invention is the 
identification and determination of the proviral DNA 

so sequence. More specifically, this aspect of the in- 
vention relates to determination And comparison of 
the proviral nucleotide sequence of the envelope 
genes of the purported etiologic agent of AIDS i.e. 
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M « Methionine; N = Asparagine; P = Proline; 
Q»Glutamine; R = Arginine; S* Serine; 
T « Threonine; V * Valine; W = Tryptophan; 
Y« Tyrosine; Z = G!utamine or Glutamic Acid. 

In accordance with the present invention, the 
search for the envelope protein of the etiologic 
agent for acquired immune deficiency syndrome 
(AIOS) has led to the isolation and sequencing of 
"the proviral gene of the AIOS virus. It has now 
been discovered, for what is believed to be the first 
time that the postulated etiologic agents of AIDS, 
lymphadenopathy-associated virus (LAV). AlOS-as- 
sociated retrovirus (ARV) and human T-cell 
leukemia/lymphoma/lymphotropic virus (HTLV III) 
are in fact variants of the same virus. For purposes 
of this invention, in the specification and claims the 
virus causing AIDS will be referred to herein as 
AIDS virus. AIDS virus will be understood to in- 
clude the variants which have been postulated as 
the causative agents of AIDS, namely LAV, ARV 
and HTLV III. The envelope protein of the AIDS 
virus (env AIDS) is a 97,200 dalton protein with 32 
potential N-glycosylation sites. Nucleotide se- 
quence analysis of the AIDS envelope gene of the 
putative etiologic agents of AIDS demonstrates that 
all the viruses are variants of the same virus. That 
means that there is approximately 1 to 20% diver- 
gence or variation from the sequence of the en* 
velope gene of HTLV III and the sequences of the 
envelope genes of the other viruses LAV and ARV* 
2. The amino add sequence of the env AIDS is set 
forth in Figure 6(a). The amino acid distribution is 
set forth in Figure 6(b). 

The nucleotide sequence of the envelope gene 
is shown in Figure 1. The proviral DNA sequence, 
using methods known to one of ordinary skill in the 
art such as the chemical degradation method of 
Max am and Gilbert of the M1 3 sequencing system 
of Messing which is a modification of the dideoxy 
nucleotide chain termination method of Sanger, 
was analyzed to determine the location of the re- 
gion coding for the envelope protein. The location 
of an open reading frame, i.e. a long stretch of 
triplet codons not interrupted by a translational stop 
codon. for the envelope gene was determined. The 
open reading frame coding for the env gene is 863 
amino acids and contained an ATG codon at the 
eighth position from the 5* end of the reading 
frame. The ATG codon is known to be a universal 
translation-initiation codon. 

The integrated proviral genome of HTLV-IH was 
cloned from the genomic ONA of H9 cells infected 
with HTLV-IH [Shaw, G.M. et al., "Molecular char- 
acterization of Human T-cell leukemia • 
(lymphotropic) virus type III in the acquired im- 
mune deficiency syndrome". Science 226, 1165- 



1171 (1984)J. Since the HTLV-III provirus was 
found to lack Xbal restriction sites, a genomic 
library was constructed by using Xbal digested 
H9/HTLV-III DNA. There are several methods avail- 

5 able to one of ordinary skill in the art for screening 
the bacterial clones containing the AIDS env pro- 
tein cDNA. These include, for example, RNA selec- 
tion hybridization, differential hybridization with a 
synthetic probe or screening for clones that pro- 

10 duce the desired protein by immunological or bio- 
logical assays. From the genomic library, colonies 
. of cells transformed with DNA that contains the 
HTLV III sequences were selected by hybridization 
screening of the library with HTLV III cDNA. The 

is DNA insert of the hybridization-positive clone, 
HXB-3, was excised from the piasmid DNA and 
sequenced. 

The predicted product of the env gene shares 
many features in common with the envelope gene 

20 products of other retroviruses. Thus, a hydrophobic 
region is seen in the middle of the protein (amino 
acids 519-534) which includes a processing site for 
the cleavage of the precursor protein into exteriQr 
and transmembrane proteins. Similarly, the amino 

25 terminal end contains a short stretch of hydropho- 
bic amino acids (amino acids 17-37) which con- 
stitutes a potential signal sequence. The HTLV-III 
envelope precursor differs from the other retroviral 
envelope protein precursors in that it contains an 

jo additional stretch of 180 amino acids at the car- 
boxy terminus. 

Polymorphism within the Envelope Region of AIDS 
Virus 

35 

The recent publication of the nucleotide se- 
quences of LAV. ARV-2 and HTLV-III [Ratner, L, et 
al.. supra; Sanchez-Pescadon. R. t et al., supra; 
Wain-Hobson, S.. et al.. supra] allows a detailed 

40 comparison of these various isolates obtained from 
AIDS patients from different pans of the world. 
HTLV-III clones were isolated from AIDS patient 
lymphocytes obtained from the east coast of the 
United States, while LAV was isolated from a 

45 French man and ARV was isolated from a patient in 
California. A comparison of the sequence data con- 
firms the earlier observations made using restric- 
tion enzyme site analysis which showed approxi- 
mately 10% variation. The present analysis shows 

so that the various isolates show the greatest amount 
of conservation in the gag and pol regions while 
the most divergence occurs in the env region. A 
comparison of the five env sequences is presented 
in Figure 2. With respect to the envelope gene, 

55 HTLV-III and LAV are more closely related to each 
other than the ARV clone. Approximately 1.6% 
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Detailed Description of the Invention 

In the description the following terms are em- 
ployed: ' 

Nucleotide : A monomeric unit of DNA consist- 
ing of a sugar moiety (pentose), a phosphate, and 
either a purine or pyrimidine base (nitrogenous 
heterocyclic). The base is linked to the sugar moi- 
ety via the glycosidic carbon (V carbon of the 
pentose). That combination of a base and a sugar 
is called a nucleoside. Each nucleotide is char- 
acterized by its base. The four DNA bases are 
adenine ("A"), guanine ("G"). cytosine ("C") and 
thymine {I 9 ). 

DNA Sequence : A linear array of nucleotides 
connected one to the other by phosphodiester 
bonds between the 3' and 5* carbons of adjacent 
pentoses. 

Codon : A DNA sequence of three nucleotides 
(a triplet) which encodes through mRNA an amino 
acid, a translation start signal or a translation ter- 
mination signal. For example, the nucleotide triplets 
TTA, TTG, CTT, CTC, CTA and CTG encode for 
the amino acid leucine ("Leu"). TAG, TAA and 
TGA are translation stop signals and ATG is a 
translation start signal. 

Reading Frame : The grouping of codons during 
translation of mRNA into amino acid sequences. 
During translation the proper reading frame must 
be maintained. For example, the sequence 
GCTGGTTGTAAG may be translated in three read- 
ing frames or phases, each of which affords a 
different amino acid sequence: 

GCT GGT TGT AAG « Ala-Gry-Cys-Lys 

G £IS GTT GTA AG » Leu-Val-Val 

GC IS£ TTGTAA G « Trp-Leu-(STOP) 

Polypeptide: A linear array of amino acids con- 
nected one to the other by peptide bonds between 
the e-amino and carfcoxy groups of adjacent amino 
acids. 

Genome : The entire DNA of a cell, or a virus. It 
includes injgr ajia. the structural genes coding for 
the polypeptides of the substance, as well as oper- 
ator, promoter and ribosome binding and inter* 
action sequences, including sequences such as the 
Shine-Dalgamo sequences. 

Structural Gene : A DNA sequence which en- 
codes through its template or messenger RNA 
("mRNA") a sequence of amino acids characteris- 
tic of a specific polypeptide. 

Transcription : The process of producing mRNA 
from a structural gene. 



Translation: The process of producing a poly- 
peptide from mRNA. 

Expression: The process undergone by a struc- 
tural gene to produce a polypeptide. It is a com- 
5 bination of transcription and translation. 

Plasmid : A circular -double-stranded DNA mol- 
ecule that is not a part of the main chromosome of 
an organism containing genes that convey resis- 
tance to specific antibiotics. When the plasmid is 

to placed within a unicellular organism, the character- 
istics of that organism may be changed or trans- 
formed as a result of the DNA of the plasmid. For 
example, a plasmid carrying the gene for 
tetracycline resistance (Tet*) transforms a cell pre- 

i$ viously sensitive to tetracycline into one which is 
resistant to it. A cell transformed by a plasmid is 
called a "transformant". 

Cloning Vehicle : A plasmid, phage DNA or 
other DNA sequences which are able to replicate in 

so _ a host cell, which are characterized by one or a 
small number of endonuclease recognition sites at 
which such DNA sequences may be cut in a deter- 
minable fashion without attendant loss of an essen- 
tial biological function of the DNA, e.g., replication, 

as production of coat proteins or loss of promoter or 
binding sites, and which contain a marker suitable 
for use in the identification of transformed cells, 
. e.g., tetracycline resistance or ampicillin resistance. 
A cloning vehicle is often called a vector. 

so Cloning : The process of obtaining a population 
Of organisms or DNA sequences derived from one 
such organism or sequence by asexual reproduc- 
tion. 

Recombinant DNA Molecule or Hvbrig- DNA : A 

as molecule consisting of segments of DNA from dif- 
ferent genomes which have been joined end-to-end 
outside of living cells and have the capacity to 
infect some host cell and be maintained therein. 
The nomenclature used to define the peptides 

40 or proteins is that used in accordance with conven- 
tional representation such that the amino group at 
the N-terminus appears to the left and the carboxyt 
group at the C-terminus to the right. By natural 
amino acid is meant one of the amino acids com* 

45 monly occurring in natural proteins comprising Gly. 
Ala. Val Leu, He. Set, Thr, Lys, Arg, Asp, Asn, Glu. 
Gin, Cys, Met Pne, Tyr, Pro. Trp and His. By Nle 
is meant norleucine, and by Nva is meant nor- 
valine. Where L and D forms are possible, it is the 

so L-form of the amino acid that is represented unless 
otherwise expressly indicated, in addition, amino 
acids have been designated by specific letters of 
the alphabet such that A -Alanine; B'Aspartic 
Acid or Asparagine; C* Cysteine; D* Aspartic Acid; 

55 E« Glutamic Acid; F« Phenylalanine; G = Glycine; 
HsHistidine; I ■ Isoleucine; K« Lysine; L* Leucine; 
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tion of an appropriate host is also controlled by a 
number of factors recognized by the art. These 
include, for example, compatibility with the chosen 
vector, toxicity of proteins encoded by the hybrid 
plasmid. ease of recovery of the desired protein, 
expression characteristics, biosafety and costs. A 
balance of these factors must be struck with the 
understanding that not aft hosts may be equally 
effective for expression of a particular recombinant 
DNA molecule. 

A preferred embodiment of the instant inven- 
tion is to express segments of the AIDS env protein 
in E. coli by inserting restriction fragments isolated 
from the cloned proviral genome into the versatile 
pEV-vrf (variable reading frame) expression plas- 
mids (for details of construction see Example 2). 
These versatile pEV-vrf plasmids are derivatives of 
pBR322 which contain the phage lambda P L pro- 
moter, a syntneti catty-derived ribosorne-binding 
site, and convenient cloning sites (EcoRI, BamHI, 
Clal and Hindlll) just downstream to the initiation 
codon (Figure 8). A set of three plasmids was 
constructed to accomodate all three translational 
reading frames. The P t promotor is regulated by a 
temperature-sensitive cl repressor encoded on the 
compatible plasmid pRK248ctts [ATCC 33766; Ber- 
nard, H.U. and Helinski, D.R.. The use of the X 
phage promotor P L to promote gene expression in 
hybrid plasmid cloning vehicles", Meth. Enzymol. 
68, 482-492 (1979)]. These expression plasmids 
have been used to produce substantial amounts of 
several heterologous proteins m E. cofi including v- 
bas p21 [Lacaf. J.C. et al.. "Expression of Normal 
and Transforming H-ras genes m E. cott and pu- 
rification of their encoded p2l proteins", PNAS 81, 
5305-5309 (1984)] and murine interieukin-1 
[Lomedico. P.T. et al., "Cloning and Expression of 
Murine lnterieufcin-1 cONA in E coli". Nature 312, 
458-462 (1984)]. 

In the present synthesis the preferred initial 
cloning vehicle is the bacterial plasmid pBR322 - 
(ATCC 37017) and the preferred initial restriction 
endonuctease sites therein are the EcorRI and Hin- 
dlll sites (Figure 3). Insertion of proviral ONA con- 
tained within the genome of H9 cells into these 
sites provides a large number of bacterial clones 
each of which contains one of the proviral ONA 
genes or fragments thereof present in the genome 
of H9 cells. Only a very few of these clones will 
contain the gene for env AIDS or fragments there- 
of. 



The preferred host for initial cloning and ex- 
pression of the env AIDS gene in accordance with 
this invention is E. coli MC 1061 [Casadaban, M.J. 
and Cohen, S.M., "Analysis of Gene Control Sig- 
5 nals by ONA Fusion and Cloning in E. coli", J. Mol. 
Biol., 138. 179-207 (1980)]. 

The coding sequences for amino acid residues 
#44 to 640 of the env protein are located down- 
stream of the P L promoter between the Kpnl and 
to Hindlll sites on the restriction map as shown in 
Figure 3. Aside from the location of these conve- 
nient restriction sites, these sequences were cho- 
sen for bacterial expression experiments because 
they did not include the amino-terminal signal pep- 

is tide as well as the hydrophobic transmembrane 
segment at the carboxyl end. These sequences 
were excluded to avoid possible toxicity problems 
which can occur when hydrophobic proteins are 
over-produced in bacterial cells. In a preferred em- 

20 bodiment of this invention an expression plasmid 
was constructed that would direct the synthesis of 
this segment of the env gene product (designated 
pEV/env 44-640). an intermediate construction was 
first made by inserting a 2400 bp EcoRI-Hindlll 

25 fragment between the EcoRI and Hindlll sites in the 
pEV-vrf plasmids. The HTLV-III sequences (600 
bp) between the EcoRI and the Kpnl site were then 
removed from the intermediate construction as 
shown in Figure 3. These plasmid constructions 

30 .were carried out with all three pEV-vrf plasmids so 
that subsequent deletions could be made and the 
correct reading frame maintained. In addition, the 
constructions made in the incorrect reading frames 
served as important controls in the expression ax- 
is periments described below. 

In another embodiment of this invention, a sec- 
ond set of expression plasmids were constructed in 
a similar fashion by deleting sequences between 
EcoRI and Stul sites which occur 483 bp down- 

40 stream of the env gene. Again these deletions - 
(designated pEV/env 205-640) were made in all 
three reading frames. The translation termination 
codon used in all of the env expression plasmids is 
presumably an in-frame TAA located 23 bp down- 

45 stream of the Hindlll site in the plasmid. Thus. 8 
amino acid residues at the carboxyl terminus are 
encoded by pBR322 sequences contained within 
the pEV-vrf expression plasmids. 

so Expression of ENV AIDS 

There are several approaches to screen for 
bacterial clones containing env AIDS cDNA. These 
include, for example. RNA selection hybridization. 
55 differential hybridization, hybridization with a syn- 
thetic probe and screening for clones that produce 
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divergence was observed between the HTLV-III - 
(HXB-3) and LAV sequence. Among the HTLV se- 
quences, the divergence was about 1.6%. How- 
ever, approximately 17% divergence was observed 
between HTLV-III and ARV-2 and this was more 
pronounced in the extracellular region of the en- 
velope gene product (Figure 2). This high rate of 
divergence could be due to the geographical loca- 
tion from where the two isolates were derived or 
the time of isolation of these variants. ARV-2 was 
isolated from the west coast of the United States 
more recently. The HTLV-III isolates for which the 
nucleotide sequences have "been determined were 
all obtained from the east coast of the United 
States a year earlier. LAV was obtained from a 
French patient who appears to have acquired the 
virus in New York about the same period. The 
observed differences in the sequence probably re- 
flect divergent evolution of strains separated in 
time or geography or both. Within the env region, 
the highest level of divergence is in the extracel- 
lular portion of the protein. 

Expression Vector 

A wide variety of host/cloning vehicle combina- 
tions may be employed in cloning the double- 
stranded DNA. For example, useful- cloning ve- 
hicles may consist of segments of chromosomal, 
nonchromosomai and synthetic DNA sequences, 
such as various known bacterial plasmids. e.g. 
plasmids from E. cota such as pBR322, phage 
DNA, and vectors derived from combinations of 
plasmids and phage DNAs such as plasmids which 
have been modified to employ phage DNA or other 
expression control sequences or yeast plasmids. 
Useful hosts may include microorganisms, mam- 
malian cells, plant cells and the like. Among them 
microorganisms and mammalian cells are prefer- 
ably employed. As preferable microorganisms, 
there may be mentioned yeast and bacteria such 
as Escherichia coli, Bacillus subtilis. Bacillus 
stearothermophilus and Actinomyces. The above- 
mentioned vectors and hosts may also be em- 
ployed for the production of a protein from a gene 
obtained biologically as in the instant invention. Of 
course, not all host/vector combinations may be 
equally efficient. The particular selection of 
host/cloning vehicle combination may be made by 
those of skill in the art after due consideration of 
the principles set forth without departing from the 
scope of this invention. 

Furthermore, within each specific cloning ve- 
hicle/various sites may be selected for insertion of 
the double-stranded DNA. These sites are usually 
designated by the restriction endonuctease which 



cuts them. For example, in pBR322 the EcoRl site 
is located just outside the gene coding for ampicil- 
lin resistance. Various sites have been employed 
by others in their recombinant synthetic schemes. 

s Several sites are well recognized by those of skill 
in the art. It is. of course, to be understood that a 
cloning vehicle useful in this invention need not 
have a restriction endonuclease site for insertion of 
the chosen DNA fragment. Instead, the vehicle 

jo could be joined to the fragment by alternative 
means. 

The vector or cloning vehicle and in particular 
the site chosen therein for attachment of a selected 
DNA fragment to form a recombinant DNA mol- 
i$ ecule is determined by a variety of factors, e.g.! 
number of sites susceptible to a particular restric- 
tion enzyme, size of the protein to be expressed, 
susceptibility of the desired protein to proteolytic 
degradation by host cell enzymes, contamination of 
20 the protein to be expressed by host cell proteins 
difficult to remove during purification, expression 
characteristics, such as the location of start and 
stop codons relative to the vector sequences., and 
other factors recognized by those of skill in the art. 
*s The choice of a vector and an insertion site for a 
particular gene is determined by a balance of these 
factors; not all selections being equally effective for 
a given case. 

There are several known methods of inserting 
30 DNA sequences into cloning vehicles to- form re- 
combinant DNA molecules which are equally useful 
in this invention. These include, for example, direct 
ligation, synthetic linkers, exonuclease and 
polymerase-linked repair reactions followed by liga- 
ss tion, or extension of the DNA strand with DNA 
polymerase and an appropriate single stranded 
template followed by ligation. 

It should, of course, be understood that the 
nucleotide sequences of the DNA fragment insert- 
40 ed at the selected site of the cloning vehicle may 
include nucleotides which are not part of the actual 
structural gene for the desired polypeptide/protein 
or may include only a fragment of the complete 
structural gene for the desired protein. It is only 
45 required that whatever DNA sequence is inserted, a 
transformed host will produce a protein/peptide 
having an immunological activity to the AIDS env 
protein or that the DNA sequence itself is of use as 
a hybridization probe to select clones which con- 
so tain DNA sequences useful in. the production of 
polypeptides/proteins having an immunological ac- 
tivity to the AIDS env protein. 

The cloning vehicle or vector containing the 
foreign gene is employed to transform a host so as 
55 to permit that host to express the protein or portion 
thereof for which the hybrid DNA codes. The selec- 
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tion. No reaction was observed with sera from 
healthy individuals or from HTLV-I infected in- 
dividuals. The patient sera were derived from all 
parts of the United States including California and 
all AIDS patients* sera tested so far were found to 
be positive. The results suggest that these anti- 
bodies are mainly directed against the protein 
back-bone of the molecule. 

It appears, therefore, that the env gene pro- 
ducts constitute the best diagnostic reagents for 
the detection of AIDS associated antibodies. The 
env gene product of the instant invention encom- 
passes a large portion of the protein molecule and 
contains both the conserved and divergent portions 
of the molecule. In spite of the divergence ob- 
served between HTLVIII and ARV-2 sequences the 
recombinant env proteins of the instant invention 
synthesized by the bacteria react with AIDS patient 
sera derived from both geographical locations of 
the United States. One hundred percent (100%) of 
AIDS patient sera (50 individual samples, 25 de- 
rived from the East Coast of the United States and 
25 derived from California) tested showed high 
reactivity. This is strong evidence for the presence 
of conserved epitopes within the molecule against 
which the immune system could mount an anti- 
body reaction. The human immune system may 
thus be mounting an immune response against 
conserved epitopes of the envelope molecule, as 
suggested by the reactivity of the AIDS patient 
sera. The observed divergence between various 
isolates of HTLV-IH thus may not pose a problem 
for the use of recombinant protein as a vaccine. 
The 68Kd protein is ideally suited for such a pur- 
pose since it encompasses a targe portion of the 
gene product and has the unique structural feature 
of containing both the extracellular hydrophilic re- 
gion and the membrane associated hydrophobic 
regions. This structural feature makes it well suited 
for encapsulation into liposomes which have been 
used as vehicles for vaccination against other vital 
envelope proteins. 

Based on these discoveries it is proposed that 
in the practice of screening blood for AIDS only 
AIDS envelope protein or a variant of said protein 
be utilized. Utilizing the env AIDS protein of the 
instant invention, human blood can be screened for 
the presence of antibodies to the AIDS virus. This 
and other techniques are readily determined, once, 
as taught for the first time by the present invention, 
the envelope AIDS protein has been recognized to 
be the envelope protein of the etiologic agent of 
AIDS. The foregoing and other objects, features 
and advantages of the invention will be apparent 
from the following examples of preferred embodi- 
ments of the invention. 



Example 1 



Molecular cloning and nucleotide sequence analy- 
s sis of the HTLV-IH proviral genome. 

The integrated proviral genome of HTLV-IH was 
recently cloned from the genomic DMA of H9 cells 
infected with HTLV-IH [Shaw, G.M. et al., supra], 
w The proviral genome which was obtained by using 
Xbal digested H9/HTLV-III DNA contained two in- 
ternal EcoRI sites within the viral genome and two 
additional sites in the cloning vector x J1. These 
sites were used for further subctoning of the three 
is DNA fragments of 5.5Kb, 4.5Kb and 1.1Kb into 
PBR322 (ATCC No. 37017). Nucleotide sequence 
analysis of the proviral genome was determined by 
the chemical degradation method of Maxam, A.M. 
and Gilbert. W., "Sequencing end-labelled DNA 

20 with base-specific chemical cleavages", Meth. En- 
zymol. 65. 499-560 (1980). For the sequence ana- 
lysis, DNA inserts from the three subclones were 
isolated by electroelution and further cleaved with 
appropriate restriction enzymes. The DNA frag- 

25 ments were labelled at their S'ends with Y-32P-ATP 
using polynucleotide kinase, or at their 3' ends with 
o-32P-NTP by filling in with DNA polymerase I - 
(Klenow fragment). The DNA fragments labelled at 
the two ends were cleaved with a second enzyme 

30 and the fragments labelled at a single end were 
purified on 5% acrytamide gels and used for se- 
quence analysis. For the sequence analysis of the 
env gene, a shotgun approach was utilized where 
the 4.5 EcoRI fragment was cleaved with one of the 

35 following enzymes: Bglll, Hindlll. Xhol. Avail, Hinfl 
and Sau3A and the restriction fragments labelled 
and sequenced as described above. The 
nucleotide sequence of the envelope gene used in 
the present invention is shown in Figure 1. 

40 

Example 2 



Construction of pEV/env 44*640 

45 

pRC2 is a derivative of pBR322 containing a 
unique Bgl II site adjacent (on the amp* side) to 
the EcoRI site in the plasmid. This plasmid was 
constructed in the following manner. 20 ug of 

so pBR322 plasmid DNA were digested with EcoRI 
and then split into two reactions. In one. the pro- 
truding 5* single-stranded termini were removed 
with Si nuclease; in the other reaction, the termini 
were filled-in by inorporating deoxynucleotides with 

55 the Klenow fragment of DNA polymerase I. Both 
reactions were terminated by phenol extraction fol- 
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the desired protein by immunological or biological 
assays. Two methods are available to screen using 
immunological assay: screening of bacteria! colo- 
nies for the presence of protein using antibody; 
and. preferably, the bacterial lysates are elec- 
trophoresed, blotted onto a nitrocellulose paper and 
then probed with the antibody. 

In a preferred embodiment of this invention, 
cultures of the E. coli strain MC 1061 transformed 
with pRK246clts and the pEV 1. 2. or 3/env 44-640 
(or pEV 1, 2 or 3/env 205-640) were grown in M9 
medium at 30 *C to mid-log phase and then in- 
duced by shifting to 42 # C for 2 hr. Samples of the 
bacterial cultures were then taken and subjected to 
SDS-polyacrylamide gel electrophoresis, followed 
by Western blot analysis to detect env proteins. 
The protein blots were treated with antisera to env 
AIDS proteins isolated either from immunized rab- 
bits or from AIDS patients previously shown to 
contain high titer antibodies to AIDS antigens. This 
was followed by incubation with '"(-labelled 
Staphylococcus aureus protein A, washing and 
autoradiography. Similar results were obtained with 
both sera except that the human serum was found 
to contain much higher titers of anti-HTLV-lll anti- 
bodies and was devoid of all background reactivity 
with the E. coli proteins. For this reason human 
antibodies were used in all subsequent character- 
ization. 

Figure 4 shows the pattern of reactivity of the 
env AIOS proteins synthesized in bacteria 
(recombinant proteins) with anti-HTLV-IH anti- 
bodies. The open reading frame in pEV3/env 44- 
640 encodes a protein that should migrate as a 68 
Kd band on the gel. tn fact a 68 Kd band is 
observed in the lane corresponding to the induced 
cells containing pEV3/env 44-640 (lane C). How- 
ever, in addition to the 68 Kd band, these cells 
synthesized proteins of 35 Kd. 25 Kd and 17 Kd 
which specifically cross-reacted with anti-HTLV-lll 
antibodies. No HTLV-III cross-reacting bands are 
evident in the uninduced control (Lane b) or in a 
second negative control sample (Lane a) of in- 
duced cells containing a plasmid that directs the 
synthesis of v-toas p2l oncogene product (Lacal. 
J.C. et aJ., supra). The appearance of multiple 
bands synthesized from the env gene sequences 
was an unexpected result. Another unexpected re- 
sult was the synthesis of env gene products from 
the plasmid (pEVVenv 44-640) where the insert 
was placed in the wrong reading frame with re- 
spect to the initiator codon immediately down 
stream of the P L promoter (Lane d). In this case. E. 
coli cells containing plasmid pEV1/env. 44-640 syn- 
thesized a 63 Kd protein in addition to the 35 Kd. 
25 Kd and 17 Kd proteins. These results could be 



readily explained when the nucleotide sequence of 
the envelope gene (Fig. 1) was examined. About 
155 bases downstream to the Kpnl site is an ATG 
codon which appeared to be utilized for the synthe- 

5 sis of the env gene product by the expression 
plasmid pEV1/env 44-640. Internal translation initi- 
ation is also the likely explanation for the appear- 
ance of the 35Kd, 25Kd and l7Kd proteins. Initi- 
ation codons which are preceded by so-called 

ro Shine-Dalgamo sequences (AGGA) are found with- 
in the env coding region at locations that are con- 
sistent with the sites of the observed protein pro- 
ducts. 

To confirm the above interpretation and to rule 

is out the possibility that the smaller proteins are not 
formed as a result of premature termination or from 
proteolytic cleavage of the larger product, another 
deletion mutant in which sequences between the 
Kpnl and Stul sites were deleted were constructed. 

20 This expression plasmid contains the coding se- 
quences from amino acid positions 205-640 which 
could code for a protein of 49 Kd. Analysis of the 
proteins induced from E. coli harboring this plas- 
mid verified that, in fact these ceils synthesize a 

25 49 Kd protein in addition to the 35 Kd. 25 Kd and 
17 Kd proteins (lane e, Fig. 4). From these results, 
it was concluded that pEV3/env 44-640 expression 
plasmid directs the synthesis of a 68 Kd protein in 
addition to several additional smaller polypeptides • 

» (i.e.. 35Kd, 25Kd and 17Kd) produced from all of 
the env expression plasmids resulting from internal 
translation initiation within the env gene. 

Screening of AIDS SERA 

35 

Because anti-HTLV-lll antibodies are found in 
more than 90% of the AIDS patients, it was of 
interest to see if the bacterially synthesized env 
gene products could be used as diagnostic tools 
40 for the detection of these antibodies. For this analy- 
sis, total cell protein from an induced bacterial 
culture was fractionated by SDS-PAGE and trans- 
ferred to a nitrocellulose filter by Western blotting 
technique. Strips of the filter containing transferred 
45 proteins were reacted with 1000-fold diluted human 
sera, and the antigen-antibody complexes formed 
were detected by incubation of the strips with 125- 
Mabelled Staphytococus aureus protein A followed 
by autoradiography. Prominent bands correspond- 
so ing to reaction of the antibody to the 68 Kd. 35 Kd, 
25 Kd and 17 Kd proteins were consistently ob- 
served when the serum used was from patients 
with AIDS syndrome. The results of such assays 
with different human sera are presented in Figure 
55 5. The negative controls used were normal human 
sera and serum from a patient with HTLV-I infec- 
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gested with Stul. Again, upon recircularization and 
blunt-end ligation, the EcoRI site was regenerated; 
however, an additional t 483 bp of env coding se- 
quences were removed. 

Example 3 



Bacterial Growth and -Induction of env Gene Ex- 
pression 

Cultures of E. coli strain MC 1061 transformed 
with plasmid pRK248clts and the pEV1, -2, or - 
3/env plasmids were grown in M9 medium contain- 
ing 0.5% glucose and 0.5% casamino acids at 
30 *C to mid-log phase and then induced by shift- 
ing to 42 # C for 2 hr. The cells were collected by 
centri fugation and processed as described in Ex- 
amples 4 and 5. 

Example 4 



Expression and Purification of Env AIDS 

A homogeneous recombinant viral env AIDS 
was purified according to the following procedure. 
The env AIDS protein expressed by a microbe 
tends to associate with the membrane fractions of 
the host microbe, principally the inner membrane 
of the microbe. The following purification method 
was designed to deal with this finding. 

This purification method comprises: 

(a) lysis of transformed microbial cells pro- 
ducing recombinant env AIDS protein; 

(b) separation of env AIDS associated cel- 
lular membranes from other cellular compo- 
nents; 

(c) extraction of env AIDS from associated 
membranes; and 

(d) chromatographic purification of the resul- 
tant extraction solution containing env AIDS 
to yield a substantially pure recombinant 
viral env protein. 

More specifically, the preferred purification 
method for the preparation of substantially pure 
recombinant viral env protein comprises: 

(a) cultivating a transformed organism con- 
taining a DNA sequence which codes for 
viral env protein; 



(b) causing a culture of the transformed or- 
ganism of step (a) to accumulate the env 
protein; 

5 (c) lysing the culture of transformed organ- 

isms of step (b) to form a cell lysate mixture; 

(d) isolating the cell membrane components 
of the cell lysate mixture of step (c); 

70 

(e) washing the isolated cell membrane com- 
ponents with an extraction solution to yield a 
wash solution containing env protein; and 

*5 (f) chromatographic^ purifying the wash 

solution of step (e) to yield a substantially 
pure env AIDS protein. 
In carrying out this method it is preferred that 
the cells be lysed by sonification, although it is 

20 forseeable that other known methods such as en- 
zyme or mechanical lysis could also be used. It is 
preferred that the cell membrane component, spe- 
cifically the inner and outer membranes, be iso- 
lated from other cellular components by methods 

25 such as centrifugation. It has been found that env 
AIDS expressed by the transformed microorganism 
tends to become associated with the cellular mem- 
branes. Therefore, isolation of these membranes 
during the purification process ensures high pu- 

30 rification levels and high purity env AIDS at the end 
of the purification procedure. 

Once the celt membranes are isolated from the 
lysate mixture, they are washed with an extraction 
solution, preferably salt solutions and a detergent 

35 to yield a second solution containing approximately 
50% env AIDS protein. Preferably the cell mem- 
branes are washed in four separate steps with the 
salt solutions and detergent although it is forseea- 
ble that certain of these steps could be combined. 

40 rearranged or eliminated. The first step of washing 
the cell membrane may be done with a salt solu- 
tion, preferably 1M NaCI. In the second step the 
celt membrane is washed with a detergent solution, 
preferably 1% Triton X-100. In the third step, the 

45 cell membrane is washed with another salt solution, 
1.75M to 3.5M guanidine HCI. The final wash is 
also with a salt solution preferably about 7M 
Guanidine HCI. The wash solution which results 
from the fourth and final wash comprises about 

so 50% env AIDS. 

The final 50% env AIDS wash solution is then 
further purified by a chromatography step, prefer- 
ably reverse phase high performance liquid 
chromatography (HPLC). The HPLC step yields 

55 env AIDS protein in* a substantially 100% pure 
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lowed by ethanol precipitation. Approximately 1 ug 
of DNA from each reaction was mixed with 90 
pmoles of phosphorylated Bglll linkers - 
(CAGATCTG. purchased from Collaborative Re- ■ 
search) and incubated with T4 DNA ligase at 15*C 5 
for 18 hours. The ligation products were then di- 
gested with Bglll and Pstl and subjected to gel 
electrophoresis in 1% agarose. The 3600 bp and 
760 bp fragments from both reactions were recov- 
ered from the gel. For the construction of pRC2. io 
the 3600 bp from the Klenow reaction was ligated 
to the 760 bp fragment from the S1 reaction. To 
construct a plasmid with the Bglll site on the other 
side of EcoRi (tet* side), designated pRCI, the 
3600 bp fragment from the S1 reaction was ligated is 
to the 760 bp fragment from the Klenow reaction. 
E. coli strain RRI (ATCC No. 31343) was trans- 
formed with the ligation mixtures, and transfor- 
mants were selected on LB agar plates containing 
50 ug/ml ampicillin. Transformants containing the 20 
expected plasmid constructions were identified by 
restriction analysis of the isolated plasmid DNA. 
DNA sequence analysis confirmed that the Si 
nuclease treatment precisely removed the 5' 
single-stranded termini. 2s 

pRC23 (see Figure 7) was constructed by in- 
serting into pRC2 a 250 bp Bgllf-Haelll fragment 
containing the X Puoromoter joined to a pair of 
complementary synthetic oligonucleotides compris- 
ing a model ribosome-binding site (RBS). The 30 
Haelll site is located within the 5' non-coding re- 
gion of the XN gene 115 bp downstream of the 
P L transcriptional initiation site. Approximately 1 ug 
of a 450 bp Bglll-Hpal fragment isolated from 
phage x DNA was digested with Haelll. 200 ng of 35 
the resulting digestion products were mixed with 60 
pmoles each of phosphorylated synthetic 
oligonucleotides containing the model RBS. The 
ligated molecules were digested with Bglll and 
EcoRI and separated on a 5% polyacrylamide gel. 40 
The 270 bp ligation product was recovered from 
the gel. mixed with gel purified pRC2 vector that 
had been digested with Bglll and EcoRI, and in- 
cubated with T4 DNA Ugase at 15*C for 15 hours. 
The ligation mixture was used to transform strain <s 
RRl(pRK248Clts). Transformants selected on 
ampicillin-conlaining medium were screened by re- 
striction analysis of the isolated plasmid DNA. The 
expected plasmid construction, pRC23, was con- 
firmed by further restriction enzyme digestions and so 
by DNA sequence analysts across the EcoRI junc- 
tion (Rg. 7). 

For the construction of the pEV-vrf set of plas- 
mids (see Figure 8), plasmid pRC23 was digested 
with EcoRI and Hindlll and the pRC23/EcoRt-Hindlll 55 
vector isolated by preparative agarose gel elec- 



trophoresis. The mixture of synthetic 
oligonucleotides (32. 33. and 34 nucleotides) was 
combined with the mixture of the complementary 
sequences, heated to 58 # C for 5 minutes in 150 
mM NaCI. and cooled slowly to allow annealing. 0.1 
pmoles of the synthetic duplexes were added to 
0.07 pmoles of the pRC23/EcoRI-Hindlll vector and 
incubated with T4 DNA ligase at 15 # C for 15 hours. 
Strain RR1 (X cl857) was transformed with the 
ligation products. Six ampicillin resistant transfor- 
mants were selected for DNA sequence analysis. 
Of the six, two contained the expected sequence 
for pEV-vrfl, one for pEV-vrf2. and three for pEV- 
vrf3 (Fig. 3). 

For the expression of the AIDS env gene, one 
ug of a 2400 bp EcoRI -Hindlll DNA fragment, 
which was isolated from the cloned HTLV-III pro- 
viral genome by preparative agarose gel elec- 
trophoresis, was mixed with 0.1 ug of EcoRI - 
Hindlll digested vector DNA (pEV-vrfl. -2. or -3). 
After heating at 65*C for 3 minutes, the mixtures 
were chilled on ice. and 20 ul ligation reactions 
were assembled, containing 50 mM Tris-HCl (pH 
7.4). 10 mM MgCI», 10 mM DTT f 0.3 mM ATp; and 
200 units of T« DNA ligase. After incubation at 
15 # C for 4 hours,the reactions were terminated by 
heating at 65 *C for 5 minutes. The ligation pro- 
ducts were used to transform E. coli strain MC1061 
containing plasmid pRK248cfts. Transformants 
were selected on Luria broth agar containing 50 
ug/ml ampicillin at 30*C for 18 hours. Plasmid 
DNA was isolated from 1 ml of each culture and 
subjected to restriction analysis. All 12 isolates 
contained the expected plasmid construction. 
These intermediate constructions were then used 
to make pEV1, -2. and -3/env 44-640 by deleting 
the 600 bp between the EcoRI and Kpnl sites as 
described below. 

Approximately 0.5 ug of plasmid DNA was 
digested with Kpnl and EcoRI. The resulting termini 
were then treated with the Klenow fragment of DNA 
polymerase I in the presence of all four deox- 
yribonucfeotides (at 100 uM) at 37*C for 30 min- 
utes. This step results in the "filling-in" of the 5* 
overhang of the EcoRI terminus and the removal of 
the 3'overhang of the Kpnl terminus. Upon retir- 
culization of the linear plasmid and blunt-end liga- 
tion of these termini, an EcoRI site is regenerated. 
Transformants containing plasmids with the expect- 
ed deletion were identified by restriction analysis. 

A second set of deletion derivatives, desig- 
nated pEV/env 205-640 was constructed in a simi- 
lar fashion. A portion of the linear plasmid that had 
been digested with EcoRI and Kpnl and treated 
with Klenow, as described above, was further di- 
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The size and shape of epitopes found in carbohy- 
drate antigens have been extensively studied, but 
less is known about the structure of epitopes from 
protein molecules. Some epitopes of protein anti- 
gens have been defined at the level of their tertiary 5 
structure. In every instance, the epitopes were 
formed not by the primary sequences alone, but by 
the juxtaposition ol residues brought together by 
the folding of the polypeptide chain(s) of the native 
molecule. In addition, the structure of the 68Kd env to 
protein of the instant invention makes it particularly 
well suited for use as a vaccine. The 68Kd env 
protein comprises a large portion of the gene prod- 
uct which (a) was shown to be reactive with all the 
AIDS sera tested; and (b) has the unique structural ?s 
feature of containing both an extracellular hydro- 
philic region and the transmembrane hydrophobic 
regions. The latter structural feature makes it well 
suited for use as a vaccine using liposome encap- 
sulation to create a vehicle for administration. 20 

Routes of administration, antigen dose, number 
and frequency of injections are all matters of op- 
timization within the scope of ordinary skill in the 
art. particularly in view of the fact that there is 
experience in the art in providing protective immu- 25 
nrty by the injection of other related antigens to 
provide immunity in other viral infections. It is arv 
ticipated that the principal value of providing immu- 
nity to AIQS infection will be for those individuals 
who have had no previous exposure to AIDS, e.g., 30 
individuals who are in the high risk population, such 
as homosexuals, drug addicts and people from 
Haiti and Central America and individuals who may 
be receiving blood transfusions. It is also antici- 
pated that temporary immunity for infants may be 35 
provided by immunization of mothers during preg- 
nancy. 

Example 7 

40 

Diagnostic Test for AIDS * 

It is clear that the env gene proteins of the 
instant invention may be used as diagnostic 45 
reagents for the detection of AIDS-associated anti- 
bodies. It is also apparent to one of ordinary skill 
that a diagnostic assay for AIDS using polyclonal or 
monoclonal antibodies to the AIDS env protein of 
the instant invention may be used to detect the so 
presence of the AIDS virus in human blood. In one 
embodiment a competition immunoassay is used 
where the antigenic substance, in this case the 
AIDS virus, in a blood sample competes with a 
known quantity of labelled antigen, in this case 55 
labelled AIDS env protein, for a limited quantity of 



antibody binding sites. Thus, the amount of label- 
led antigen bound to the antibody is inversely 
proportional to the amount of antigen in the sam- 
ple. In another embodiment, an immunometric as- 
say may be used wherein a labelled AIDS-env 
antibody is used. In such an assay, the amount of 
labelled antibody which complexes with the 
antigen-bound antibody is directly proportional to 
the amount of antigen (AIDS virus) in the blood 
sample. In a simple yes/no assay to determine 
whether the AIDS virus is present in blood, the 
solid support is tested to detect the presence of 
labelled antibody. In another embodiment, mon- 
oclonal antibodies to AIDS env protein may be 
used in an immunometric assay. Such monoclonal 
antibodies may be obtained by methods well 
known in the art, particularly the process of Milstein 
and Kohler reported in Nature 256, 495-497 (1975). 

The immunometric assay method is as follows: 
Duplicate samples are run in which 100 ul of a 
suspension of antibody immobilized on agarose 
particles is mixed with 100 ul of serum and 100 ul 
of soluble '"l-labelled antibody. This mixture is for 
specified times ranging from one quarter hour to 
twenty four hours. Following the incubation periods 
the agarose particles are washed by addition of 
buffer and then centrifuged. After removal of the 
washing liquid by aspiration, the resulting pellet of 
agarose particles is then counted for bound 
labelled antibody. The counts obtained for each of 
the complexes can then be compared to controls. 

While the invention has been described in 
terms of certain preferred embodiments, modifica- 
tions obvious to one with ordinary skill in the art 
may be made without departing from the scope of 
the invention. For example, it is understood that the 
env AIDS DNAs described herein represent only 
the precise structure of two naturally occurring 
gene segments, ft is expected that slightly modified 
alleles will be found encoding for similarly function- 
ing proteins, and such gene segments and proteins 
are considered to be equivalents for the purpose of 
this invention. It is also suspected that other vari- 
ants in addition to those described herein will be 
found and that the envelope protein of said variants 
will differ slightly. These variant envelope proteins 
are likewise considered within the scope of the 
invention. ONA having equivalent codons is consid- 
ered within the scope of the invention, as are 
synthetic gene segments that encode homologous 
proteins of the viral envelope. 

Various features of the invention are set forth in 
the following claims. 
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form. It is also foreseeable that monoclonal anti- 
body affinity chromatography columns utilizing env 
AIDS polyclonal or monoclonal antibodies, could be 
used as an alternative to HPLC. 

Example 5 



Polyacrylamide gel electrophoresis and Western 
bJot analysis , 0 

Cells were lysed by resuspending the cell pel- 
lets (approximately 10' cells) in TG buffer (10 mM 
Tris. pH 7.4, 10% glycerol), mixed with an equal 
volume of 2 x sample buffer [Laemmli, U.K., is 
"Cleavage of Structural Proteins During the Assem- 
bly of the Head of Bacteriophage T4", Nature 227, 
680-685 (1970)] and incubated at 95*C for five (5) 
minutes. Cell debris were pelleted by centrifugation 
and the cleared lysates were subjected to SDS- 20 
PAGE analysis [Laemmli, U.K., supra]. For Western 
blot analysis, the proteins from the acrylamide get 
were electroblotted onto a 0*1 urn nitrocellulose 
membrane (Schleicher and Schuell) for 16 hr at 
50V, in 12.5 mM Tris. 96 mM glycine. 20% metha- 2$ 
nol. 0.01% SDS at pH 7.5. Processing of the blot 
was carried out using the methods described by 
Towbin, H. et al. ["Electrophoretic Transfer of Pro- 
teins From Polyacrylamide Gels to Nitrocellulose 
Sheets: Procedure and Some Applications", Proc. 30 
Natl. Acad. Sci. U.S.A., 76. 4350-4354, (1979)]. For 
treatment with the human sera, the blots were 
incubated with a 1000 fold dilution of the sera in 
antibody buffer (20 mM sodium phosphate buffer, 
pH 7.5. containing 0.5 M NaCI, 1% BSA and 0.05% 35 
Tween 20) for 2-6 hr. The blots were then washed 
twice with phosphate buffered saline containing 
0.05% Tween 20 and then incubated with 125-1- 
labelled Staphylococcus aureus protein A for an 
additional period of 1 hr. The blot was then washed 40 
twice in PBS-Tween 20 buffer, dried and auto- 
radiographed. 

Example 6 



Immunization with Env Protein of AIDS Virus 

It is clear that in spite of the divergence ob- 
served between HTLVill and ARV-2 sequences, the so 
recombinant proteins synthesized by the bacteria 
react well with AIDS patients' sera derived from 
both geographical locations of the United States. 
One hundred percent (100%) of the AIDS patients' 
sera tested showed high reactivity (50 individual 55 
samples. 25 from the east coast of the United 



States and 25 from the west coast of the United 
States). Thus all the env proteins contain at least 
one conserved epitope. All of the human sera from 
AIDS patients tested contained antibodies to the 
env proteins of the instant invention. This strongly 
suggests that these env proteins with the con- 
served epitopes would be immunogenic in man. 

It will be readily appreciated that the env pro- 
teins of the instant invention can be incorporated 
into vaccines capable of inducing protective immu- 
nity against the AIDS virus, By methods known in 
the art, the specific amino acids ,conprising the 
epitopes of the env protein may be determined. 
Peptides may then be synthesized, comprising an 
amino acid sequence corresponding to an epitope 
of an env AIDS protein either in monomeric or 
multimeric form. These synthetic peptides may 
then be incorporated into vaccines capable of in- 
ducing protective immunity against AIDS virus. 
Techniques for enhancing the antigenicity of such 
peptides include incorporation into a multimeric 
structure, binding to a highly immunogenic protein 
carrier, for example, keyhold limpet hemocyanin, or 
diphtheria toxoid, and administration in combination 
with adjuvants or any other enhancers of immune 
response. In addition, the vaccine composition may 
comprise antigens to provide immunity against oth- 
er diseases in addition to AIDS. 

An amino acid sequence corresponding to an 
epitope of an env protein either in monomeric or 
multimeric form (peptide) may be obtained by 
chemical synthetic means or by purification from 
biological sources including genetically modified 
microorganisms or their culture media. The peptide 
may be combined in an amino acid sequence with 
other peptides including fragments of other pro- 
teins, as for example, when synthesized as a fusion 
protein, or linked to other antigenic or non-antigenic 
peptides of synthetic or biological origin. The term 
•corresponding to an epitope of a env protein" will 
be understood to include the practical possibility 
that, in some instances, amino acid sequence vari- 
ations of a naturally occurring peptide may be 
antigenic and confer protective immunity against 
AIDS infection. Possible sequence variations in- 
clude, without limitation, amino acid substitutions, 
extensions, deletions, interpolations and combina- 
tions thereof. Such variations fall within the con- 
templated scope of the invention provided the pep- 
tide containing them is antigenic and antibodies 
elicited by such peptide cross-react with naturally 
occurring env protein or non-variant repeated pep- 
tides of env protein, to an extent sufficient to pro- 
vide protective immunity when administered as a 
vaccine. Such vaccine compositions will be com- 
bined with a physiologically acceptable medium. 
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METArg 

ValLysGluLysTyrGlnHisLeuTrpArgTrpGlyTrpArgTrpGlyThrMETLeuLeuGlyMETLeu 

METIleCysSerAlaThrGluLysLeuTrpValThrValTyrTyrGLyValProValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHlsAsnValTrpAlathr 

HisAlaCysValProThrAspProAsnProGlriGluValValLeuValAsnValThrGluAsnPheAsn 

METTrpLysAsnAspMETValGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 

ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 

AsnSerSerSerGlyArgMETIleMETGluLysGiyGluIleLysAsnCysSerPheAsnlleSerThr 

SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 

AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 

PheGluProIleProIleHtsTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 

PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 

SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGiuGluValVallleArgSerValAsnPheThr 

AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 

AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGly 

AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePhetysGlnSerSerGlyGlyAspProGlu 

IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 

ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGiyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 

AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLyslleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 

AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValtrpGlylleLysGlnLeuGln 

AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 

LysLeuIleCysThrThrAlaValProTrpAsnAiaSerTrpSerAsnLysSerLeuGluGlnlleTrp 

AsnHisThrThrTrpMETGluTrpAspArgGluILeAsnAsnTyrThrSerLeuIleHisSerteuIle 

GluGluSerGlnAsnGlrK31nGluLysAsnGluGlnGluLeuLei^luLeuAspLysTrpAlaS 

TrpAsnTrpPheAsnlleThrAsnTrpLeuTrpTyrlleLysLeuPhelleMETIleValGlyGlyLeu 

ValGlyLeuArglleValPheAlaValLeuSerValValAsnArgValArgGlnOlyTyrSerProLeu 

SerPheGlnThrHlsLeuProlleProArgGlyProAspArgProGluGLylleGluGluGluGlyGly 

GluArgAspArgAspArgSerlleArgLeuValAsnGlySerteuAlaLeuIleTrpAspAspLeuArg 

SerteuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 

LeuGlyArgArgGlyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeu 

LysAsnSerAlaValSerLeuteuAsnAlaThrAlalleAlaValAlaGluGlyThrAspArgvallle 

GluValValGlnGluAlaTyrArgAlalleArgHisIleProArgArglleArgGlnGlyLeuGluArg 

IleLeuLeu 



3. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 
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Claims 



1. An envelope protein of an acquired immune 
deficiency syndrome (AIDS) virus comprising the s 
amino acid sequence: 



ValLysGluLysTyittlnHisUuTrpA^ 

METIleCysSerAlaThrGluLysLeuTrpValThrValTyrTyrClyValProValTrpLysGluAla 
ThrThrThrLeuPheCysAlaSerAspAlaLysAlalyrAspThrGluValHlsAsnValTrpAlaThr 
HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrGluAsnPheAsn 
HETTrpLysAsnAspMETValGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnthr 
AsnSerSerSerGlyArgMETIleMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 
SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 
PheGluProlleProlleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLyslleGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGlnCluValGlyl-ysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnllelforGlyleu^ 

AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysvalValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuXSlyPheLeu^lyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnOlnLeuLeuGlyllelrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMBTGluTrpAspArgGluIleAsnAsnTyrThrSerLeuIleHlsSerLeulle 
GLuGluSerGlnAsnGlnGlrKHuLysAsnGluXUnGluLe^ 

TrpAsnTrpPheAsnlleThrAsnTrpUuTrpTyrlleLysLeuPhelleMETIleValGlyGlyLeu 
ValGlyLeuArglleValPheAlaValLeuSerValValAsnArgValArgGlnGlyTyrSerProLeu 
SerPheGinThrHisUuProIleProArgGlyProAspArgPt'oGluGlylleGluGluGluGlyGly 
GluArgAspArgAspArgserlleArgLeuValAsnGlySerLeuAlaLeuIleTrpAspAspLeuArg 
SerLeuCysLeuPheSerTyrHlsArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 
LeuGlyArgArgGlyTrpCluAiaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeu 
LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleAlaValAlaGluGlyThrAspArgVallle 
GluValValGlnGluAlaTyrArgAlalleArgHisIleProArgArglleArgGlnGlyLeuGluArg 
IleLeuLeu 



or fragments thereof. 

so 

2. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 
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5. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 



METArgGlnAlaHlsCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheOlyAsnAsnLysThrllellePheLysGlnSerSerGlyClyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluKJlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
PrcX^sArgIleLysGlnj>heIleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyClyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMBTGly 
AlaAlaSerMETThrl^uThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlrKJlnAsnAs 
LeuLeuArgAlaIleGluAlaGlrK5lnHisLeuLeuGlnLeuThrValTrpGlyIleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHlsThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 



6. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 

25 

„ , , METTyrAlaProProIle 

SerGlyGlnlleArgC^sSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlytsiyAsnAsnAsn 

AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLyslleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
A 1 aA 1 aSe rMETThr LeuThrValGlnA 1 ^ 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuUeuGinLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 



7. An envelope protein of an AIDS virus according 

to claim 1 with the amino acid sequence: <0 



METArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLysIleGluProLeuX31yValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuJheL^^ 

AlaAlaSerMETThrLeuThrValGlnJUaArgGlnLeuLe^ 

LeuLeuArgAlalleGluAlaGlrtGlrUlisLeuLeuxSlnLeuThrValTrpGlylleLysGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuClyileTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 



9. An expression vector comprising a gene coding 
8. An envelope protein as claimed in any one of for an envelope protein of an AIDS virus down- 

claims 1 to 7 that is a homogeneous protein free of M stream of a promoter sequence enabling transcrip- 
other AIDS viral proteins. tion. translation and thus expression of said en- 

velope protein in a host cell. 
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ValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlatyrAspThrGluValHisAsnValTrpAlaThr 
HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrCluAsnPheAsn 
MBTTrpLysAsnAspMETValCluGlnMETHisCluAspIlelleSerLeutrpAspGlnSerLeuLYs 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspthrAsnThr 
AsnSerSerSerGlyArgMETIleMETCluLysClyGluIleLysAsnCysSerPheAsnlleSerThr 
SerlleArgGlyLysValGlnLysGlutyrAlaPhePheTyrLysCeuAspIlelleProIleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsaThrSerVallleThrGlnAlaCysProLysValSer 
PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHlsGlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGiyArgAlaPheValThrlieGlyLysIleGly 
AsnMETArgGinAlaHisCysAsnlleSerArgAULysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGinLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyClyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGinArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAUGlnGinHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuG.luGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

4. An envelope protein of an AIDS virus according x 
to claim 1 with the amino acid sequence: 

PheGluProIleProIleHlsTyrCysAiaProAlaGlyPheAlalleLeuLys^snSLjlThJ 
PheAsnClyThrGlyProCysThrAsnValSerlhrValcinCysThrHisGlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnOlySerLeuAlaOluGluOluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLyslleArglleolnArgGlyProGlyArgAlaPheValThrlleoiyLyslleGly 
AsnMETArgGlnAlaHtsCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysOlnlleAlaSer 
LysLeuArgGluGlnPheOlyAsnAsnLysThrllellePheLysGlnSerSerolyGlyAspProGlu 
IleValThrHisSerPheAsnCysOlyGlyOluPhePheTyrCysAsnSerThrOlnLeuPheAsnSer 
ThrTrpPheAsnSerThfTrpSerThrGluOlySerAsnAsnThrOluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyClyGlyAspHETArgAspAsnTrpArgSerOluLeuTyrLys 
TyrLysValValLysIleOluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlyHeOlyAlaLeuPheLeuGlyPheLeuOlyAlaAlaOlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuOlnLeuThrvalTrpOlylleLysOlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluClnlleTrp 
AsnHlsThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 
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TGTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTCCGATTCTAAAATGTAATAATAAGACC 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 
AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTCTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
A TTGT AACGCACAGTTTTAATTGTGCAGGGGAATTTTTCTACTCTAATTCM 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGG^ 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 

GC AAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 

AAACTAATTTGCACCACTGCTGTGCCTTGGAA^^ 

AATCACACGACGTGGATXX/AGTGC^ACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. 

12. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 75 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 



ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAACT 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATG^ 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCA 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTTC 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTC 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTC 

AAACTAATTTGCACCACTGCTGTGCCrTGGAATGCTAGTTGGAGTAATAAAT 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. 45 

13. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 90 
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10. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 



GTGTGGAAGGAAGCA 

. ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATCATACAGAGGTACATAATCTTTGOGCCACA 
CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 
ATGTCftAAAAATGACATGGTAGAACAGATGCATGAGGATATAATC 

CCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 

AATAGTACTAGCGGGAGAATGATAATGGAGAAAGGAGACATAAAAAACT^ 

AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 

GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATM 

TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 

TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 

GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 

AATATGAGACAAGCACATTGTAAGVTTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 

AmTAACGCACAGTTTTAATTGTGGAGGGGAATTmCTACTGTAACTCAA^ 

ACTTGGmAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAM 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTA 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGC^^ 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTT^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGA^ 

AAACTAATTTGCACCACTCKTGTGCCTTGGAATGCT^ 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

35 

or an equivalent thereof. 

11. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the ^ 
nucleotide sequence: 



45 



SO 



55 



19 



41 



0 199 301 



42 



ence of antibodies to the viral etiologic agent of 
AIDS which comprises mixing a composition con- 
taining an envelope protein of an AIDS virus as 
claimed in any one of claims 1 to 8 with a sample 
of human blood and determining whether said en- s 
velope AIDS protein binds to AIDS antibodies 
present in the blood sample. 

30. A method according to claim 29 which com- 
prises the use of the Western Blotting Analysis. io 

31. A method according to claim 29 which com- 
prises the use of an ELISA-technique, wherein an 
envelope protein of an AIDS virus as claimed in 

any one of claims 1 to 8 is coated on a solid phase t s 
and contacted with the sample and after washing 
contacted with an enzyme-labeled non-human IgG. 

32. A method according to claim 29. wherein the 
Double-Antigen-Method is used. 20 

33. A method for the determination of AIDS virus, 
wherein antibodies against an envelope protein of 
an AIDS virus according to any one of claims 1 to 

8 are used. 25 

34. A method according to claim 33. wherein the 
antigen in the sample and a protein as claimed in 
any one of claims 1 to 8 in labeled form compete 

with an antibody against a protein as claimed in so 
any one of claims 1 to 8. 

35. A method according to claim 33, wherein a 
sandwich method is performed using two anti- 
bodies against a protein as claimed in any one of is 
claims 1 to 8. 

36. A method according to claim 35, wherein one 
antibody is on a solid phase and the other antibody 

is labeled. 40 

37. A method according to claim 35, wherein two 
different monoclonal antibodies are used. 



38. A vaccine eliciting immunity to AIDS compris- 
ing as an active ingredient a protein as claimed in 
any one of claims 1 to 8. 

39. Antibodies raised against a protein as claimed 
in any one of claims 1 to 8. 

40. The antibodies of claim 39 which are mon- 
oclonal antibodies. 

41. The use of a protein as claimed in anyone of 
claims 1-8 for the preparation of a protective im- 
munisation vaccine. 

42. The use of a protein as claimed in any one of 
claims 1 to 8 for testing human blood for the 
presence of AIDS virus. 

Claims for Austria: 

1. A process for the preparation of an envelope 
protein of an acquired immune deficiency syn- 
drome (AIDS) virus comprising: 

transforming a host cell with an expression vector 
comprising a gene coding for an envelope protein 
of an AIDS virus downstream of a promoter se^ 
quence enabling transcription, translation and ex- 
pression of said envelope protein in said host cell; 

curturing said host cell so that said envelope pro- 
tein of an AIDS virus is expressed; and 

extracting and isolating said envelope protein of an 
AIDS virus. 

2. A process according to claim 1, characterized in 
that as a gene coding for an envelope protein of an 
AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 
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ATGTATGCCCCTCCCATC 

AGCCGACAAATTAGATCTTCATCAAATATTACAGCGCTGCTATTAACAAGAGATGCTCGTAATAACAAC 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
CAAAAAACAGCAGTCXX3AATAGGAGCTTTGTTC 

GCAGCCTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAAaiAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCT^ 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCMCAGCTCCTGGGGATTTGGGGTTGCT 
AAACTAATTTGCACCACTGCTGTGCCTTO 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 
or an equivalent thereof. 

14. An expression vector according to claim 9 
wherein said gene coding for an envelope protein '5 
of an AIOS virus is a gene comprising the 
nucleotide sequence: 



ATGAGGCACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
CAAAAMGAGCAGTGCGAATAGGAGCTTTGTTCCTTGGGTTCT^ 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACA^ 

KAAGAATCCTGGCTGTGGAMGATACCTAAA^ 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCT 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



30 23. A transformant according to claim 22 which is 

15. An expression vector according to any one of an E. coli MC 1061 strain, 
claims 9 to 14 which is a plasmid capable of 

replication in gram-negative and/or gram-positive 24. A transformant according to claim 21 which is a 

bacteria. b. subtilis strain. 

35 

16. An expression vector according to claim 15 25. A transformant according to claim 21 which is a 
which is capable of replication in an E. coli strain. eucaryotic cell. 

17. An expression vector according to claim 15 26. A method of producing an envelope protein of 
which is capable of replication in a B. subtilis ^ an acquired immune deficiency syndrome virus as 
stato- claimed in any one of claims 1 to 8 comprising: 

18. An expression vector according to claim 15 or transforming a host cell with an expression vector 
16 which is a member of the pEV/env family. as claimed in any one of claims 9 to 20; 

45 

19. An expression vector according to claim 18 cutturing said host celt so that said AIDS env 
which is pEV1 , -2. or -3/env 44-640. protein is expressed; and. 

20. An expression vector according to claim 18 extracting and isolating said AIOS env protein, 
which is pEV1, -2, or -3/env 205-640 

21. A transformant carrying an expre 
claimed in any one of claims 9 to 20. 

22. A transform 
an E. coli strain. 



50 

27. A method according to claim 26 wherein the 

21. A transformant carrying an expression vector as expression vector is pEV1, »2 or *3/env 44-640. 

28. A method according to claim 26 wherein the 

22. A transformant according to claim 21 which is. 55 expression vector is pEV1 t -2 or -3/env 205-640. 

29. A method of testing human blood for the pres- 
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ValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAUTyrAspThrGluValHisAsnValTrpAlaThr 

HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrGluAsnPheAsn 

METTrpLysAsnAspMETValCluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 

ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 

AsnSerSerSerGlyArgMETIleMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 

SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 

AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 

PheGluProlieProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 

PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 

SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 

AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 

AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGly 

AsnMETArgGlnAlaHisCysAsnlleSerArgAiaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 

IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 

ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 

AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValtysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 

AlaAlaSerMETThrLeuThrValGlrJVlaArgGlnLeuLeuSerGlylleValGlnGlr^lnAsnAsn 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln. 

AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 

LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 

AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

is used. AIDS virus a gene capable of effecting expression 

30 of the protein comprising the amino acid sequence 
4. A process according to claim 1, characterized in of: 
that as a gene coding for an envelope protein of an 

Cy sProLy sVa 1 Se r 

PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHlsGlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaG 

AspAsnAlaLysThrllelleValGlnLeuAsnthrSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheVaithrlleGlyLysIleGly 
AsnMETArgGlnAlaHlsCysAsnlleSerArgAlaLysTrpAsnAlathrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheClyAsnAsnLysT^ 

IleValThrHisSerPheAsnCysGlyGlyGluPhePhetyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerthrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlr^helleAsnMETTrpGlnGluvalGlyLysAlaMETTyrAlaProProIl^ 
SerGlyx?lnXleArgCysSerSerAsnIleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuClyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerthrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlrKSlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHlsThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

55 
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HETArg 



ValLysGluLysTyrClnHisLeuTrpArgTrpGlyTrpArgTrpGlyThrMETLeuLeuGlyMETLeu 
METIleCysSerAlaThrGluLysLeuTrpValThrValTyrTyrGlyValProValTrpLysGluAla 
ThrThrThrLeuPheCysAlaSerAspAlaLysAUTyrAspThrGluValHisAsnValTrpAlaThr 
HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrGluAsnPheAsn 
METTrpLysAsnAspMETValGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 
ProCysValLysLcuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgMETIleMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 
SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspUelleProIleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 
PheGluProIleProIleHisTyrCysAiaProAlaGiyPheAUlleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHlsGlylleArgProValVai 
SerThrGlnLeuLeuLeuAsnGiySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrllelieValGlnLeuAsnThrSerValGluIieAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLyslleGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAiaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGiuGinPheGlyAsnAsnLysThrliellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMEmpGinGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLei^lyValAlaProThrLysAULysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAUSerMETThrLeuThrValGlrAlaArgGlnLeuLeuSerGlylleValGlnGlrKSlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnllefrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSerLeuIleHisSerLeuIle 
GluGluSerGlnAsnGlnGlnGluLysAsnGLuGlnGluLeuLeuGluLeuAspLysTrpAlaSerLeu 
TrpAsnTrpPheAsnlleThrAstnTrpLeuTrpTyrlleLysLeuPhelleMETIleValGlyGlyLeu 
ValGlyLeuArglleValPheAlaValLeuSe/ValValAsnArgValArgGlnGlyTyrSerProLeu 
SerPheGlnThrHisLeuProIleProArgGlyProAspArgProGluGlylleGluGluGluGlyGly 
GluArgAspArgAspArgSerlleArgLeuValAsnGlySerLeuAlaLeuIleTrpAspAspLeuArg 
SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArgllevalGluLeu 
LeuGlyArgArgClyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeu 
LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleAlaValAlaGluGlyThrAspArgVallle 

GluValValGlnGluAlaTyrArgAlalleArgHisIleProArgArglleArgGlnGlyLeuGluArg 
IleLeuLeu 



is used. 
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AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



3. A process according to claim 1, characterized in 
that as a gene coding for an envelope protein of an 



45 



50 



55 



23 



49 



0 199 301 



50 



is used. 

8. A process according to any one of claims 1 to 7 
wherein the host cell is a bacterium. 

9. A process according to claim 8 wherein the 
bacterium is £. coli. 

10. A process according to claim 9 wherein the 
expression vector is a plasmid of the pEV/env 
family. 

11. A process according to claim 10 wherein the 
plasmid is pEVl, -2. or -3/env 44-640. 

12. A process according to claim 10 wherein the 



plasmid is pEVl , -2, or -3/env 205-640. 

13. A process for the preparation of an expression 
vector comprising a gene coding for an envelope 

5 protein of an AIOS virus, which process comprises 
constructing an expression vector having an inser- 
tion site wherein said gene may be inserted which 
insertion site is downstream of a promoter se- 
quence enabling transcription, translation and thus 

10 expression of said envelope protein in a host cell. 

14. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 

is sequence 



GTGTGGAAGGAAGCA 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGACKJTACATAATGTTTGGGCCACA 

CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 
ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATAT 

CCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 

AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA 

AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 

GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 
TTTGAGCCAATTCCCATACATTATTGTGCTC 

TTCAATGGMCAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 

TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGAT 

GACAATGCTAAAACCATAATAGTACAGCTGAACAC^TCTGTAGAAATTAA 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 
AATATG*3ACAAGCACATTGTAACATTAGTAGAGCAAM 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAG 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCA 

GCAGCGTC^TGACGCTGACGGTACAGGCCAGACAATTATTGTCTG^ 

TTGCTCAGGGCTATTGAGGCGCAACAGCATCTGTTTC^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAATC^ 

AAACTAATTTGCACCACTGCTGTGCCTTGGM 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof is used. 

15. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIOS virus a gene comprising the nucleotide 
sequence 
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is used. 



AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
5. A process according to claim i, characterized in of: 
that as a gene coding for an envelope protein of an 



^^9GlnAlaHisCysAsnIleSerArgAlaLysTrpAsnAlaThrLeuLysGlnIleAUSer 
LysUu^gOluOlnPheG^ 

Iw e ! al ^ rHisSerPhe ^ 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGiuGlySerAspThrlleThrLeu 
ProCysArglieLysGinPhelleAsn^ 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysVaiValLyslle^luProLeuGlyValAlaProThrLysAlaLysArgArgvalValGlnjvrg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMEmrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysClnLeuGln 
AlaArglleLeuAUValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsntyrThrSer 

IS used * AIDS virus a gene capable of effecting expression 

of the protein comprising the amino acid sequence 
6. A process according to claim 1, characterized in of: 
that as a gene coding for an envelope protein of an 25 



m , METTyrAlaProProIle 

SerGlytJlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeufyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMOThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlrWisLeuLeuClnLeuThrValTrpGlyileLysGlnLeuGIn 
AlaArglleLeuAUValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

aA 40 

,s used ' AIDS virus a gene capable of effecting expression 

of the protein comprising the amino acid sequence 
7. A process according to claim i, characterized in of: 
that as a gene coding for an envelope protein of an 

4$ 

METArgAspAsnTrpArgSerGluleuTyrLys 
TyrLysValValLysIleGluProLeuClyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGiylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrleuThrValGlnAlaArgGl^^^ 
LeuleuArgAlalleGluAlaClnGinHisLeu 

AlaArglleLeuAlaValGluArgTyrUuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHtsThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 
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AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTG 
AATaXJTCOTAGATCTTCAGACCTGGA^ 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTACCACCCACCAAGGCAAACAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTA 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAA 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCJUICTCACAGTCTGGG 

GCAAGAATCCTGGCTCTGGAAAGATACCTAAAGGATCAA»GCT 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCT 

AATCACACGACGTGGATGGAGTGGGACAGAGAAA7?TAACAATTACACAAGC 

or an equivalent thereof is used. 



18. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of '5 
an AIDS virus a gene comprising the nucleotide 
sequence 



ATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCC 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGG 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTC^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGOT 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTC 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



or an equivalent thereof is used. 

19. A process according to any one of ciaims 13 to 
18 wherein the expression vector is a plasmid 
capable of replication in gram-negative bacteria. 

20. A process according to claim 19 wherein the 
plasmid is capable of replication in an E. coli strain. 

21. A process for the preparation of a transformant 
carrying an expression vector comprising a gene 
coding for an envelope protein of an AIDS virus, 
which process comprises transforming a microor- 
ganism with an expression vector obtained accord- 
ing to any one of claims 13 to 20 and cultivating 
the transformed microorganism. 

22. A process according to claim 21 wherein the 
microorganism is an E. coli strain. 

23. A process according to claim 22 wherein the 
microorganism is an E. coli MC 1061 strain. 

24. A process of testing human blood for the pres- 
ence of antibodies to the viral etiologic agent of 
AIDS which process comprises mixing a composi- 



30 

tion containing an evelope protein of an AIDS virus 
obtained according to any one of claims 1 to 12 
with a sample of human blood and determining 
whether said envelope AIDS protein binds to AIDS 
as antibodies present in the blood sample. 

25. A process according to claim 24 which com- 
prises the use of the Western Blotting Analysis. 

40 26. A process according to claim 24 which com- 
prises the use of an Elisa-technique. wherein an 
envelope protein of an AIDS virus obtained accord- 
ing to any one of claims 1 to 12 is coated on a 
solid phase and contacted with the sample and 

45 after washing contacted with an enzyme-labeled 
non-human IgQ. 

27. A process according to claim 24, wherein the 
Double-Antigen-Method is used. 

50 

28. A process for the determination of AIDS virus, 
wherein antibodies against an envelope protein of 
an AIDS virus obtained according to any one of 
claims 1 to 12 are used. 

55 

29. A process according to claim 28. wherein the 
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TGTCCAAAGGTATCC 

TTTCAGCCAATTCCCATACATTATTGTGCCCCGGCTCGTTTTGOGATTCTAAAATGTAATAATAAGACG 
TTCAATCGAACAOGACCATCTACAAATGTCAGCACAGTACAATCTACACATCCAATTACCCCAGTAGTA 
TCAACTCAACTGCTGTTAAATG<^AGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTOTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCaVCTTTAAMCAGATAGCTA^ 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGM 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCrACTGTAATTCAACACAACTGTTrAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAOGAAGTGACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGXGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGCACAAATTAGATGTTCATa^AATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCOGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTCCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGA(^GCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGAOGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTCAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTC7GGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGCACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof is used. 

16. A process according to claim 13 characterized 

in that as a gene coding for an envelope protein of « 
an AIDS virus a gene comprising the nucleotide 
sequence 

ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAAiAATCTTTAAGCAATCCTCAGGAGGGGACCaVGAA 
ATTGTAACGCAaVGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTOACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAAa\AC 
AATGOGTCCGAGATCTTCAGACCTGGAGGAGGAGATATOAOGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
CAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGaVGGAAGCACTATGGGC 
CCAGOGTa^TGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTAIAGTCCAGCAGCAGAAaUVT 
TTGCTGAGOGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GC ^ G AATCCTGGCTGTGGAAAGATACCTAAAGGATCMCAGCTCCTGGOGATTTGGGGTTGCT 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTCGAGTAATAAAtCTCTGGAACAGATTTTC 
AATCACACGAOGTOGATGGAGTGGGACAGAGAAATTAACAATTACACAAOC 

or an equivalent thereof is used. 48 

17. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence 50 
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or an equivalent thereof. 

39. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 5 
nucleotide sequence: 

TCTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAA 

TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTA 

GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGA 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTCTTAATTGTGGAG 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCA^ 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAAC^ 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGG^ 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGG 

CCAGCGTCAATGACGCTGACGGTACAGGCCAGA^^ 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCA^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTT^ 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCT 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. 30 

40. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 35 

ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAK 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTG 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAA 

CCATGCAGAATAAAACAATTTATAAACATGT<XXAOGAAGTAGGAAAAGCAATGTATGCCCCTCCCA^ 

AGCGGACAAAraAGATGTTCATCAAATATTACAGGGCTGCTATTAACAA 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAG 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGM 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTC 

GCAAGAATCCTGC^TGTGGAAAGATACCTAAAGGATCAACAGCTCC^^ 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAA 

AATCACACGACGTGGATGGAGTGGCACAGAGAAATTAACAATTACACAAGC 
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antigen in the sample and a protein obtained ac- other AIDS viral proteins 
cording to any one of claims 1 to 12 in labeled 

2nZ n £?H With , 3n • an,ib0dy , a9a,nst a " ro,8in 35. An expression vector comprising a gene coding 

obtamed accord.n 9 ,o any one of claims , «o 12. , 0f an envelope protein of an AIDS virus whenever 

* 5 prepared by a process as claimed in anv one nf 

30. A process according to claim 28. wherein a claims 13 to 20. 
sandwich method is performed using two anti- 

VS* aCC ° rdinfl 10 "* * A tranS, ° rmant an •«»»*" 

compnsing a gene coding for an envelope protein 

» . . » . to of an AIDS virus whenever prepared bv a orocess 

31. A method accord.ng to claim 30. wherein one as claimed in any one of claims 21 to 23 

antibody is on a solid phase and the other antibody 

is labeled *•* 

37. An expression vector comprising a gene coding 

™ a ^ ,or an « nv «'ope protein of an AIDS virus down- 

aCC0 ; dm ^. C,aim » " herein *° " «ream of a promoter sequence enabling transcn> 
d.fferent monoclonal ant.bod.es are used. eon. translation and thus expression of said e^ 

« A , v ©'ope protein in a host cell, 

33. An envelope protein of an AIDS virus whenever 

c^toJ Pr0C6SS 33 C,aim6d ^ °' * An eXPreSSi ° n V6Ct0r ^ording to claim 37 

20 "herein said gene coding for an envelope protein 
*m . of an AIDS virus is a gene comDrisino the 

34. An envelope protein of an AIDS virus according nucleotide sequence: 
to claim 33 that is a homogeneous protein free of 

GTGTGGAAGGAAGCA 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATOTTTGGGCCACA 
CATOCCTGTOIACCCACAGACCCCAACCCACAAOAAGTAGTATTOGTAAATGTGACACAAAATTTTAAC 
ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAG 
CCATGTGTAAAATTAACCCCACTCTCTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 
AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA 
AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 
GATACTACCAGCTATAGGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 
TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACG 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATOGAATTAGGCCAGTAGTA 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 
AATACAAGAAAAAAAATCCGTATCCAGAGGGGACaVGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGOGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCAa\CTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACGVTTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGOGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCACCAGCAGAAaUVT 
TTGCTGAGGGCTATTGAGGOGCAACAGCATCTGTTGCAACTCACAGTCTGGGOCATCAAGCAGCTCCAG 
GQVAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTOGGGATTTGGGGTTGCTCTGGA 
AAACTAATITGCACCACTGCTGTCCCTTOGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGO 
AATCACACOACGTGGATOGAGTGGGACAGAGAAATTAACAATTACACAAGC 
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54. The use of a protein as claimed in anyone of 
claims 1-8 for the preparation of a protective im- 
munisation vaccine. 
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or an equivalent thereof. 

41. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 

of an AIOS virus is a gene comprising the s 
nucleotide sequence: 

ATGTATGCCCCTCCCATC 

ACCCCACAAATTAGATGTTCATCAAATATTACAGCGCTGCTATTAACAAGAGATGGTGCTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGC 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAG 

GCAGOTTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGaVGAAGVAT 
TTGCTGAGGGCTATTGAGGCGCAACACKATCTG^^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTG^ 
AAACTAAnTGCACCACTGCTGTGCCTTGGAA 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

20 

or an equivalent thereof. 

42. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 

of an AIDS virus is a gene comprising the 25 
nucleotide sequence: 

ATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAG^ 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCMGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGTC 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGA^ 
AATCACACGAOGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



43. An. expression vector according to any one of 
claims 37 to 42 which is a plasmid capable of 
replication in gram-negative bacteria. 

44. An expression vector according to daim 43 
which is capable of replication in *i E. coli strain. 

45. An expression vector according to claim 43 or 
44 which is a member of the pEV/env family. 

46. An expression vector according to claim 45 
which is pEV1. -2, of -3/env 44-640. 

47. An expression vector according to claim 45 
which is pEV1, -2, or -3/env 205-640. 

48. A transformant carrying an expression vector as 
claimed in any one of claims 37-47. 



49. A transformant according to claim 48 which is 
an E. coli strain. 

50. A transformant according to claim 49 which is 
an E. coli MC 1061 strain. 

^ 51. Antibodies raised against a protein obtained 
according to any one of claims 1 to 12 and 33 and 
34. 

52. The antibodies of claim 51 which are mon- 
M oclonal antibodies. 

53. A vaccine eliciting immunity to AIOS compris- 
ing as an active ingredient a protein obtained ac- 
cording to any one of claims 1 to 12 and 33 and 

« 34 
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I ATTCTGCAACAACTGCTGTTTATCCATTTTCAGAATTGGGTGTCGACATAGCAGAATAGGOGTTACTCG 69 
70 ACAGAGGAGAGCAAGAAATGGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGC 138 
139 CTAAAACTGCTTOTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAA 207 
208 AAGCCTTAGGCATCTCCTATGGCAGGAAGAAGOGGAGACAGOGAOGAAGACCTCCTCAAGGCAGTCAGA 276 
277 CTCATCAAGTTTCTCTATCAAAGCAGTAAGTAATACATGTAATGCAACCTATACAAATAGCAATAGTAG 345 
3 46 CATTAGTAGTAGCAATAATAATAGCAATAGTTGTGTGGTCCAIAGTAATCATAGAATATAGCAAAATAT 4 1 4 
415 TAAGACAAAGAAAAATAGACAGGTTAATTGATAGACTAATAGAAAGAGCAGAAGACAGTGGCAATGAGA 483 
484 GTGAAGGAGAAATATCAGCACTTGTGGAGATGGGGGTGGAGAIGGGGCACCATGCTCCTTGGGATGTTG 552 
553 ATGATCTGTAGTOCTACAGAAAAATTGTGGGTCACAGTCTATTATCGGGTACCTGTGTGGAAGGAAGCA 621 
622 ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATGTTTGGGCCACA 690 
691 CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 759 
760 ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAG 828 
829 CCATGTGTAAAATTAACCCCACTCTGTGTtAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 897 
898 AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAAC7GCTCTTTCAATATCAGCACA 966 
967 AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 1035 
1036 GATACTACCAGCTATACGTTCAaVAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTAICC 1104 
1105 TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGOGATTCtAAAATGTAATAAIAAGACG 1173 
1174 TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACAfCGAATTAGGCCAGTAGTA 1242 
1243 TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAiSATCTGTCAATTTCACG 1311 
1312 GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATtGTACAAGACCCAACAAC 1380 
1381 AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 1449 
1450 AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 1518 
1519 AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 1587 
1588 ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 1656 
1657 ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 1725 
1726 CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 1794 
1795 AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAAaVAGAGATGGTGGTAATAACAAC 1863 
1864 AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 1932 
1933 TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTOGTGCAGAGA 2001 
2002 GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 2070 
2071 GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 2139 
2140 TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 2208 
2209 GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACA6CTCCTGGGGATTTGGGGTTGCTCTGGA 2277 
2278 AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 2346 
2347 AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAATACACTCCTTAATT 2415 
2416 GAAGAATCGCAAAACCAGCAAGAAAAGAATGAACAAGAATTATTGGAATTAGATAAATGGGCAAGTTTG 2484 
2485 TGGAATTGGTTTAACATAACAAATTGGCTGTGGTATATAAAATTATTCATAATGATAGTAGGAGGCTTG 2553 
2554 GTAGGTTTAAGAATAGTTTTTGCTGTACTTTCTGTAGTGAATAGAGTTAGGCAGGGATATTCACCATTA 2622 
2623 TOGTTTCAGACCCACCTCCCAATCCCGAGGGGACCCGACAGGCCCGAAGGAATAGAAGAAGAAGGTGGA 2691 
2692 GAGAGAGACAGAGACAGATCCATTCGATTAGTGAACGGATCCTTAGCACTTATCTGGGACGATCTGCGG 2760 
2761 AGCCTGTGCCTCTTCAGCTACCACCGCTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTGGAACTT 2829 
2830 CTGGGACGCAGGGGGTGGGAAGCCCTCAAATATTGGTGGAATCTCCTACAATATTGGAGTCAGGAGCTA 2898 
2899 AAGAATAGTOCTGTTAGCTTGCTCAATGCCACAGCTATAGCAGTAGCTGAGCOGACAGATAGGGTTATA 2967 
2968 GAAGTAGTACAAGAAGCTTATAGAGCTATTCGCCACATACCTAGAAGAATAAGACAGGGCTTGGAAAGG 3036 
3037 ATTTIGCTATAAGATGGGTGGCAAGTGOTCAAAAAGTAGTGTGGTTGGATGGCCTGCTGTAAOGGAAAG 3105 
3106 AATGAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGA 3156 
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50 

HXB- 3 MRVKEK YQHLWRWGWRWGTML LGMLM I CS ATEKLWVTVYYGVP VWKEATT 

BH-8 y 
LAV K I 
ARV-2 K --GTRRN 



51 



100 



HXB- 3 TLFCASDAKAYDTEVHNVWATHACVPTDPNPQEWLVNVTENFNMWKNDM 

BH-10 

BH-8 

LAV 

ARV-2 R G N 



1Q 1 150 

HXB- 3 VEQMHED 1 1 S LWDQSLKPCVKLTPLCVSLKCTDLKNDTNTNS S SGRMIME 

BH-10 
BH-8 

LAV G A NTNSS E M 
ARV-2 Q T N G A NWKEEI 



151 



200 



HXB- 3 • KGE I KNCSFN I STS I RGKVQKEYAFFYKLD 1 1 P I DND- - TTSYTLTS - - -CNTSV 
BH-10 

BH-8 K 
LAV 

ARV-2 T D I N L RN W AST N NYRLIH R 

201 250 

HXB-3 ITQACPKVSFEP IPIHYCAPAGFAILKCNNKTFNGTGPCTNVSTVQCTHG 

BH-10 

BH-8 

LAV A 
ARV-2 T K 

251 300 

HXB-3 I RPWSTQLLLNGS LAEEE W I RS VNFTDNAKT 1 1 VQLNTS VE I NCTRPN 

BH-10 A Q 

BH-8 D 

LAV A Q 

ARV-2 I D N B A 



THIS 



0 199 301 



301 350 

HXB - 3 NNTRKK I R I QRGPGRAFVT I GK I GNMRQ- AHCN I S RAKWNATLKQ IASKLR 
BH-10 S N D 

BH-8 D 
LAV S 

ARV-2 S Y H T R 1GDIRK Q N E VK 



351 400 

HXB- 3 EQFGNNKTI IFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTVf 

BH-10 

BH-8 

LAV 

ARV-2 VN MR T N -RLNH 



401 450 

HXB - 3 STEGSNNTEGSDT I TLPCR I KQF I NMWQEVGKAMYAPP I SGQ I RCS SN I T 

BH-10 K I 

BH-8 K I 
LAV 

ARV-2 K N I I G S 



451 500 

HXB- 3 GLLLTRDGG-NNNNGSEIFRPGGGDMRDNWRSELYKYKWKIEPLGVAPTK 
BH-10 - S E 

BH-8 - S E 

LAV - 

ARV-2 TVTDTV II 



501 550 

HXB -3 ' AKRRWQREKRAVG I -GALFLGFLGAAGSTMGAASMTLTVQARQLLSG I VQ 

BH-10 

BH-8 

LAV - R 

ARV-2 V M V L 

551 600 

HXB- 3 QQNNLLRA I EAQQHLLQLTVWG I KQLQAR I L AVERYLKDQQLLG I WGCSG 

BH-10 G 

BH-8 

LAV 

ARV-2 V R 
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601 650 

HXB-3 KLICTTAVPWNASWSNKSLEQIWNHTTWMEWDREINNYTSLIHSLIEESO 
BH-10 MM 

BH-8 NM 

LAV NM 

ARV-2 D DNM Q E D NT YT 



651 



700 



HXB-3 NQQEKNEQELLELDKWASLWNWFNITNWLWYIKLFIMIVGGLVGLRIVFA 
BH-10 

BH-8 

LAV I 
ARV-2 SI 



701 



750 



HXB - 3 VLS WNRVRQGYS P LSFQTHLP I PRGPDRPEG I EEEGGERDRDRS I RLVN 
BH-10 

BH-8 I N 

LAV I T 

ARV-2 I R V D V D 



751 800 

HXB-3 GSLAL I WDDLRSLCLFS YHRLRDLLL I VTR I VELLGRRGWEALKYWWNLL 

BH-10 

BH-8 

LAV 

ARV-2 F E R AA T I H S 



801 850 

HXB-3 QYWSQELKNSAVSLLNATAIAVAEGTDRVIEWQEAYRAIRHIPRRIRQG 

BH-10 . G 

BH-8 N LA 

LAV G C 

ARV-2 I W T A R L H 

851 856 

HXB-3 LERILL 

BH-10 

BH-8 

LAV 

ARV-2 L 



" - " designates a deletion of one amino acid. An empty space 
denotes identity with HXB-3 sequence.- 
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Figure 5 
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FIGURE 6A 



METArg 

VaHysGlulysTyHilnHULeuTrpArgTr^lyTrpArgTrpGlyThrfCTLeuLeuGlyfCTl^ 
HET1 1 eCysSerAl aThrGI ulysLeuTrpVal ThrVal TyrTyrGl yVal ProVal TrplysCI uAla 
ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrCluValHlsAsnValTrpAlaThr 
HisAI aCysVa 1 ProThrAspProAsnProG 1 nG 1 u Va 1 Va 1 LeuVa 1 AsnVa 1 ThrGI uAsnPheAsn 
METTrpLysAsnAsptf TValGluGlnMCTHisGluAspllelleSerLeuTrpAspGlnSerleuLys 
ProCysVa 1 lysleuThrProleuCysVal SerteoLysCysThrAspleuLysAsnAspThrAsnThr 
AsnSerSerSerKj1yArglCTI1e«£TGlutysG1yGluneLysAsnCysSerPheA$nI1eSerThr 
S«rI1eArgG1ylysVa1G1nLysG1uTyrAlaPhePheTyrLysLeuAspI1eZ1eProI1eAspAsn 
AspThrThrSerTyrThrleuThrSerCysAsnThrSerVallleThrGlnAlaCysPPOLysValSer 
PheG 1 uPro 1 1 eProI 1 eH i sTyrCysAI aProAl aG 1 yPheAl al 1 eLeulysCysAsnAsnlysThr 
PheAsnGI yThrGI yProCysThrAsnVal SerThrVal G1 nCysThrHi sGl yll eArgProVal Val 
SerThPGInLeuLeuleuAsrKilySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArglyslysIl eArgll eGl nArgGI yProGl yArgAl aPheVal Thrl 1 eGl ylysl 1eG1 y 
AsnMETArgGlnAlaHisCysAsnneSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysteuArgG 1 uGl nPheG 1 yAsnAsnLysThrl 1 el 1 ePhelysGI nSerSerG 1 yGl yAspProGlu 
IleValThrHi sSerPheAsnCysGI yGlyGI uPhePheTyrCysAsnSerThrGl nteuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGl ^ 

ProCysArgIleLysGlnPheI1eAsnM£TTrpG1f>G1uVa1GlyLysAla«TryrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGI ySerGl ul 1 ePteArgProGI yGl yGl yAspHETArgAspAsnTrpArgSerGl uteuTyrlys 
TyrlysValVallysIleGluProleuGlyValAlaPro^^^ 

G1uLysArgAlaValGlyIleG1yA1aLeuPheLeijGlyPheLeuG1yAlaA1aGlySerThrfCTGly 
AlaA1aSer*TThrlwThrValGinAlaArgGM^ 
LeuLeuArgAlalleGluAlaGlnGlnHUleuLe^ 
Al aArgl 1 eleuAl aval GluArgTyrLeuLysAspG^ 

LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnlysSerLeiiGluGlnlleTrp 
AsnHi sThrThrTrp«ETGluTrpAspArgG1 uIleAsnAsnTyrThrSerLeuIleHisSerLeuIle 
GluGl uSerGI nAsnGI nGl nGI ulysAsnGl uGl nGluLeuLeuGI uleuAsplysTrpAl aSerLeu 
TrpAsnTrpPheAsnlleThrAsnTi^lw^ 

ValGlyLeuArgneValPheAlaValLeuSerValValAsnArgVaUrgGlnGlyTyrSerProLeu 
SerPheGlnThrt^teuProIlePr^^ 

GluArgAspArgAspArgSerlleArgLeuValAsnGlySerleuAlaLeuIleTrpAspAspCeuArg 

SerleuCysleuPheSer Ty rH i sAr gLeuArgAspleu t euleu 1 1 eVal ThrArgl leValGluleu 

leuGl yArgArgGl yTrpGI uAI aleulysTyrTrpTrpAsnleuleuGl nTyrTrpSerCl nGluLeu 

LysAsnSerAUVa1SerleuLeuAsnA1aThrA1al1eA1aVa1AlaG1uG1yThrAspArgVa1I1e 

GluValValGlnGluAlaTyrArgAlalleArgHisN^ 

IleLeuLeu 
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FIGURE 6B 

5 AMINO ACID DISTRIBUTION 



OF AIDS ENV PROTEIN 



10 A Alanine 47 

B Aspartic Acid-Asparagine 0 

C Cysteine 21 

D Aspartic Acid 27 

E Glutamic Acid 49 

15 F Phenylalanine 26 

G Glycine SB 

. H Histidine 14 

I Isoleucine 53 

K Lysine 44 

20 L Leucine 83 

M Methionine 17 

N Aspacagine 60 

P Proline 29 

Q Glutamine 42 

25 R Arginine 52 

S Serine 57 

T Threonine 60 

V Valine 56 

W Tryptophan 31 

30 y Tyrosine 20 

Z Glutamine-Glutaraic Acid 0 
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